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Abstract — We address the distributed estimation of an un- 
linown scalar parameter in Wireless Sensor Networlis (WSNs). 
Sensor nodes transmit their noisy observations over multiple 
access channel to a Fusion Center (FC) that reconstructs 
the source parameter. The received signal is corrupted by 
noise and channel fading, so that the FC objective is to 
minimize the Mean-Square Error (MSE) of the estimate. In 
this paper, we assume sensor node observations to be correlated 
with the source signal and correlated with each other as 
well. The correlation coefficient between two observations is 
exponentially decaying with the distance separation. The effect 
of the distance-based correlation on the estimation quality is 
demonstrated and compared with the case of unity correlated 
observations. Moreover, a closed-form expression for the outage 
probability is derived and its dependency on the correlation 
coefficients is investigated. Numerical simulations are provided 
to verify our analytic results. 

I. Introduction 

Recent advances of micro-sensor fabrication technology 
allow for producing cheap and small sensor nodes with 
wireless communication capabilities. Consequently, Wireless 
Sensor Networks (WSNs) become an economically sound 
solution to wide range of applications such as environmental 
and wildlife habitat monitoring, target tracking for defense 
purposes, and health care [1]. Typical WSN consists of large 
number of sensor nodes deployed in an area of interest to col- 
lect specific information about the surrounding environment. 
The need for large number of sensor nodes in WSNs while 
being cost-effective constraints the industry standards to 
produce battery-powered sensor nodes with simple hardware. 
As a result of the limited energy and processing capabilities 
of sensor nodes, the collected information has to be sent to 
'a Fusion Center (FC) for centralized processing. 
■ One important application of WSNs is the distributed 
estimation of scalar parameters (see, e.g., [2], and references 
therein). In such application, sensor nodes transmit their 
observations over a Multiple Access Channel (MAC) to the 
FC. The received signal is distorted by the channel fading 
and the additive noise. The FC is required to reconstruct the 
source parameter with minimum Mean-Square Error (MSE). 
Depending on the available information about the source 
statistics, different estimators can be used to achieve the MSE 
criterion. The performance of the Best Linear Unbiased Esti- 
mation (BLUE) [3], Minimum Mean Squared Error (MMSE) 
estimator [4], [5], and Maximum Likelihood Estimator 
(MLE) [6], [7] are studied in literature. Both orthogonal 



MAC [8] and coherent MAC [9], [10] are considered in the 
distributed estimation problem. Assuming Gaussian source 
signal and noise, amplify-and-forward schemes significantly 
outperform the traditional source-channel coding for both 
multiple access channels [11]. Optimal power allocation 
for sensor nodes under different constraints is addressed in 
[3], [4], [8]. Asymptotic behavior of the distortion is also 
studied in [12]. The MSE performance for the coherent MAC 
asymptotically approaches to zero as the number of sensors 
increases to infinity. However, this is not the case for the 
orthogonal MAC where the MSE reaches a finite non-zero 
value as the number of sensor nodes increases [9]. Diversity 
order of estimation distortion is introduced in [3] and shown 
to be given by the number of sensors. 

In most WSN applications, the source parameter is a 
physical quantity like temperature, pressure, humidity, sound, 
... etc. Therefore, the sensor node observations are correlated 
where the correlation coefficient is exponentially decaying 
with distance. In literature, simple signal models were usu- 
ally assumed. For example, unity correlated observations 
are assumed in [3], where sensor nodes measure a noisy 
version of the source signal. The correlation between the 
observations and the source signal (denoted hereafter as the 
source-node correlation) is considered in [3]. However, in 
this model the correlation between observations (denoted 
hereafter as the inter-node correlation) is determined by the 
source-node correlation as we will show later. Considering a 
correlation model with inter-node correlation that determined 
by the distance between sensor nodes is a more realistic 
assumption [10], [13]. 

In this paper, we study the distributed estimation of a 
scalar parameter where sensor nodes transmit their obser- 
vations to the FC over a coherent MAC. The observations 
are spatially correlated and corrupted by noise. Moreover, 
the communication channel is subject to fading and Additive 
White Gaussian Noise (AWGN). The FC uses the received 
signal to estimate the source parameter using LMMSE esti- 
mator. The distance-based correlation model of [14] is used 
in this paper to characterize the source-node correlation and 
the inter-node correlation. The effect of the distance-based 
correlation on the estimation performance is demonstrated 
and compared with the case of unity correlated observations. 
The outage probability is adopted as the performance mea- 
sure. A new closed-form expression for the outage probabil- 



ity in terms of quadratic forms is introduced. It is shown that 
less correlated observations degrade the performance. 

Hereafter, small letters, bold small letters, and bold capital 
letters will designate scalars, vectors, and matrices, respec- 
tively. If A is a matrix, then A^, A^, and eig (A) denote 
the hermitian, the transpose, and the eigenvalues of A, 
respectively. We define diag(a) to be a diagonal matrix 
formed from vector a. The function [-J returns the same 
value for the positive values and zero for the negative values 
and the function max(-) returns the maximum value. 

II. System Model 

Consider a WSN consisting of N sensor nodes and a FC 
as shown in Fig. 1. Sensor nodes are required to observe a 
scalar parameter modeled by a zero-mean complex Gaussian 
random variable s ^ CJ\f{Q, a'^). The signals measured by 
individual sensor nodes can be described as 



x.i = Si + Ui, i = 1, . . . ,iV, 



(1) 



where Si is the ith sensor node observation and rii ^ 
CJ\f{0, cr^J is the observation noise. The signals s and 
Si,i = 1, . . . , N, are modeled as zero-mean joint Gaussian 
random variables, i.e. E{si} = 0,i = 1,...,A^, E{ssi} = 
piag,i — I, . . . ,N, where pi is the source-node correlation 
coefficient between s and Si. Moreover, sensor node obser- 
vations Si,i ~ l,...,N, are correlated to each other, i.e. 
EjSiSj} = prj(Tl,i,j = 1,...,A^, i ^ j, where p.^ is the 
inter-node correlation coefficient between Si and Sj . 

The correlation coefficients are non-negative and decrease 
monotonically with distance. Using the power exponential 
model presented in [14], pi and py are functions of di and 
dij, respectively, according to the relation 



pid) 



h >O,O<02<2,de{d„dy}, (2) 



where di is the distance between the event source and the 
ith node, dij is distance between sensor i and j, pa = 1, 
and 6*1 and 62 are the model parameters, where 61 normalizes 
the distance and O2 controls the correlation decay rate. Let us 
define the matrix C as the cross-node correlation matrix with 
Pij is the element at the ith row and jth column. Assuming 
random sensor node locations, C will be a full-rank matrix, 
and therefore this model will be referred to as full-rank cor- 
relation model. The aforementioned model for the correlation 
coefficients pi and pij is more generalized than the one 
used in [3]-[5] where the signal is given as Xi{t) — s{t) + 
Vi{t),i — 1, . . . ,N, and thus Si{t) = s{t) in this case. This 
results in unity correlated source and observations, i.e. pi = I 
and Pij = 1, i,j = 1,. . . ,N. Also, the signal model used 
in [9], [15] is x,{t) = h,s{t) +Vt{t),i = l,...,iV, which 
corresponds to Si{t) = his{t) in Eq. ([T]i and results in the 
special case pij — piPj where E{s{t)si{t)} = a'^hi — cr^pi 
and E{si(t)sj(i)} — a'^hihj = a^piPj. Let us define the 
vector Y —[pi p2 ... pn] , then the cross-node correlation 
matrix for the later case is given by C = rr^ which is a 
rank one matrix and thus will be referred to as rank-one 
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Fig. 1. System model. 



correlation model. In the rest of the paper, we will compare 
between the three correlation models. 

For the amplify-and-forward scheme, the transmitted sig- 
nal from the ith sensor node is given by yi — aiXi,i = 
1,...,N, where Oj is the amplification factor Here, the 
transmit power for each node is E{|y,:p} = af{cr'^ + c^). 
Assume that the Channel State Information (CSI) is available 
at the FC. Then, the coherent combining of the transmitted 
signals received at the FC is 



N 

z = ^ a^giXi + I/, 



(3) 



where i' is the communication noise i' ^ CJ\f{Q, al) and gi 
is the Rayleigh fading for the ith node gi ^ CJ\f{0, a^). 

Given the signal and channel statistics, the LMMSE esti- 
mate s can be expressed as 



N 

zo-1 E a^giPi 

i=l 



*^ E{z2}^~ JV N N 

'^sT, T, a^a-jgigjp^j + J2 (^lajgf + a^ 
and the corresponding distortion becomes 
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(5) 



The normalized distortion D — ^ is then given by Eq. (|6]l 
(shown at the bottom of the next page). This expression is 
the generalization of the equivalent one given in [5, Eq. 2] 
for the unity correlated source and observations. The first 
term in the numerator and the first term of the denominator 
in Eq. (|6|l are the result of the inter-node correlation. 

III. Effect of the correlation on the distortion 

In this section, the special cases of unity correlation 
and rank-one model are compared to full-rank model. For 
simplicity, the channel fading is neglected, i.e. gi — l,i = 
1,...,A^, and Equal Power Allocation (EPA) is assumed. 



i.e. Gi = y^Ptot/N{aj +cr^) = a, where Ptot is the 
total transmit power for all sensor nodes. Accordingly, the 
normalized distortion expression for the full-rank model 
reduces to 

f)FR _ 
Uq — 

/ N N 



'^W E E p.. 



1=1 j=i 




/ N N \ 

'^W iEEp^A + NaW + <yi 



(8) 



Considering the signal model in [9, Eq. 10], which results 
in the special case pij = piPj (rank-one model), the 
corresponding normalized distortion is given by 
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Fig. 2. The behavior of the normalized distortion as the number of sensor 
(9) nodes A'^ increases for the three coiTelation models {d\ = 250, 62 = 1> 
<j'll(y'i = 20 dB, aljal = 20 dB, cr^ = 1, and Ptot = 10 dB). 



Finally, for unity correlated source and observations, i.e. 
Pi = I and Pij = 1, Vi,j, the normalized distortion 
expression reduces to 
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(10) 



which is equivalent to the expression in [5, Eq. 2]. 

Comparing the aforementioned expressions, it is apparent 
that D^'~' and Dq — > as the number of sensor nodes 
goes to infinity. However, Dq^ does not vanish under the 
same condition. The effect of correlation on the distortion 
when the number of sensor nodes increases is depicted in 
Fig. 2. Eqs. ^, (|9|l, and ( fTOl i are averaged over 1000 random 
realization of sensor node locations and plotted for increasing 
N. Here, the observation Signal-to-Noise Ratio (SNR) is 
defined as a-'^/cr'^ = 20 dB and the communication SNR 
as (CTs + a'^)/a1 = 20 dB, where cr^ = 1. The coiTelation 
model has di = 250 and O2 = I- The total transmit power 
of all sensor nodes is Ptot = 10 dB (normalized to a^). As 
expected, the distortion for the unity correlated case tends 
to zero as N increases. The same behavior is noticed for 
the rank-one model, however with slightly higher distortion 
in this case. Conversely, the distortion for full-rank model 
exhibit a floor behavior at a non-zero distortion (w 0.182). 
Therefore, correlation between observations should be taken 
into consideration when designing distributed estimation 
schemes because weak correlation degrades its performance. 



IV. Outage probability analysis 

The normaUzed distortion in Eq. (jSJ depends on the chan- 
nel fading, correlation coefficients, channel and measurement 
noise power, and amplification factors. Given the distance- 
based correlation model, the correlation coefficients will be 
invariable for specific network geometry. Therefore, with 
fixed power allocation and noise power, the channel fading 
is the only determining factor for the fluctuation in the 
estimation distortion. In many applications, the interest is to 
characterize the maximum distortion rather than the average 
distortion. In such situations, the outage probability is used as 
a performance measure where the outage refers to the event 
at which a desired performance level cannot be satisfied. 
Here, the outage probability is defined as the probability that 
the normalized distortion exceeds a certain value S, 



Pout(5)=Pr(^>^). 



(11) 



To find a closed-form expression for the outage prob- 
ability, we first express the distortion in vector form. 
Let us define the vectors g = [gi §2 ■ ■ ■ gN] , sl ~ 
[fli 02 . . . Oat] , and r and C as defined previously. It 
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follows that J2 0'i9iPi = z^g, where z = Wr and W = 
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g^Bg = ||g|||, where B = WCW^ and '£<^l9i = 

4 = 1 

g^W^g — ||g||^2- Then the normahzed distortion can be 
expressed in terms of indefinite quadratic forms as 

,2 



D = 



\s\\h. 



&IIB2 



(12) 



where Bi =a'^^{B-F) + cr^ W^ and B2 = cr^B + cr^ W^. 
Accordingly, the outage probability takes the form 
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where E{5) — S 'B2 — Bi. Using the results of [16], the 
outage probability Pout{S) can be expressed as 



Pn, 



t(^)="(( 



1~S\ct: 



N 

E 



(-A,) 



TV 



1 



- n (A^ - AO A, 



_ (1-5)0 



1-(5U 



xe 



A/ 



(14) 



depends on the eigenvalues of E{S) (exact expression) or 
the eigenvalues of F and B (lower bound). 
Assume equal power allocation, one has 
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where X+{5) = max ( eig ( rr^ -11-5) C ) ) /iV is the 



factor reflecting the effect of the correlation on the largest 
eigenvalue and it depends on the geometry of the WSN. 
Considering that the term PtotCs/fcr^ + o'^) is constant for 
specific network setting, the factor A+((5) will be referred to 
as the normalized eigenvalue. The outage probability in this 
case is expressed as 



PoutiS) = 1 



A 



Af-l 



n (A+ - A.) 



xexp 



{a'^+al){l-5)al 



,0<5<1. (19) 



tot"s'^+ 



where A; = Xi{S),i = 1, . . . ,N, are the eigenvalues of the 
matrix E(^) and u() is the Heaviside unit-step function. 
This expression can be simplified if E is substituted by its 
component matrices. 



where the last approximation results from considering that 
cr^j ^ (T^. Recall that the matrix F is the outer product of 
the vector z by itself, hence it is of rank one and positive 
semidefinite. Moreover, B is also positive semidefinite ma- 
trix. Therefore, E has only one non-negative eigenvalue and 
all other eigenvalues can simply canceled off in (fl4l i because 
of the unit-step function. Finally, the expression of outage 
probability simplifies to 



AK^)=eig (e(J)) = eig (5 B2 - Bi^ 

=eig (afF - af (1 - ~5)B - ^^(l - ~6)W''^ 
^a^eig (f - (1 - 5)b) , Z = l,...,iV, (15) 
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(16) 



where A+ = A+((5) is the only non-negative eigenvalue of 
E(i5). Using Weyls inequality [17], this eigenvalue can be 
lower bounded by 
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(17) 



where A^„ and A^.,^ are the largest eigenvalues of the 
matrices F and B, respectively. The outage probabiHty 



Clearly, the outage probability is a monotonically decreasing 
function of A_|_((5) and thus related to the signal and obser- 
vations correlation. 



V. Numerical Results 

In this section, the analytic results are confirmed by 
numerical simulations. Consider a WSN that consists of A^ = 
10 sensor nodes randomly located in a square area with side 
length of 20 m. It is required to estimate a source parameter 
located 30 m away from the center of the sensor nodes. 
The settings of Fig. 2 are assumed for all the following 
simulations unless otherwise stated. Moreover, the channel 
fading has variance a^ = 1. All simulation results are 
averaged over 1000 independent runs. 

Fig. 3 shows the outage probability vs. the normalized 
distortion. The analytic expressions are plotted with solid, 
dashed, and dotted lines and the simulations are plotted with 
circle, diamond, and plus marks. The closed-form expression 
is shown to be in perfect match with the simulation results 
and the accuracy of the closed-form approximation of the 
outage probability is verified. The correlation clearly affects 
the outage performance where more outage occurs for both 
distance-based correlation models. Note that for the full-rank 
model. Pout — 1 for D with values less than « 0.182 (i.e. 
the distortion is always larger than this value) which agrees 
with the results of Fig. 2. 

Fig. 4 compares between the exact expression for the 
largest eigenvalue (normalized by Ji^Ji ) and its approxi- 
mation in Eq. (17) for different source locations. The source 
distance to the center of the square area is set to 50 m, 30 m. 
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Fig. 3. Outage probability vs. nomialized distortion (0i = 250, 62 = 1, 
(7^(71 = 20 dB, 0-2/0-2 = 20 dB, o^ = 1, and Ptot = 10 dB). 
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and m. Note that A+ is upper bounded by 
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Recall that the outage probability is a monotonically 
decreasing function of A+((5) and situations with larger 
eigenvalue corresponds to a better outage performance (i.e. 
lower outage probability). From the figure, we conclude that 
observing farther source reduces the largest eigenvalue and 
thus increases the outage probability. 
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VI. Conclusions 

In this paper, distributed estimation of a scalar param- 
eter in WSNs is considered. Correlated source signal and 
observations are assumed and the effect of correlation is 
investigated. A closed-form expression for the outage prob- 
ability is derived to link between the correlation and the 
outage performance. It is shown that higher distortion levels 
occurs with higher probability when assuming correlated 
observations as compared to unity correlated ones. Moreover, 
the distortion does not vanish when increasing the number of 
sensor nodes indefinitely for the distance-based correlation. 
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